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Data clustering: a review 

A. K. Jain, M. N. Murty, P. J. Flynn 

September 1999 ACM Computing Surveys (CSUR), volume 3i issue 3 

Full text available: ■gpdjl5M.24.KB} Additional Information: fei!.cit3tion, abstract, Merences, citirigs, index,t= 

Clustering is the unsupervised classification of patterns (observations, data items, or feature vect 
(clusters). The clustering problem has been addressed in many contexts and by researchers in mi 
reflects its broad appeal and usefulness as one of the steps in exploratory data analysis. However 
difficult problem combinatorially, and differences in assumptions and contexts in different commu 
transfer of useful generic co ... 



Keywords: cluster analysis, clustering applications, exploratory data analysis, incremental cluste 
indices, unsupervised learning 



Picture Processir^g by Computer 

Azriel Rosenfeld 

September 1969 ACM Computing Surveys (CSUR), Volume 1 Issue 3 

Full text available: pdff;2.59 rviB) Additional Information: fi.iil citation, references , citings, index terms 



Access methods for text 

Chris Faloutsos 

March 1985 ACM Computing Surveys (CSUR), Volume 17 issue i 

Full text available: ■gpc[f(2J9.M.B] Additional Information: fvlll-cMtion. abstract, references, cjtings, index„t 

This paper compares text retrieval methods intended for office systems. The operational requirenr 
environment are discussed, and retrieval methods from database systems and from information r 
are examined. We classify these methods and examine the most interesting representatives of ea 
to speed up retrieval with special purpose hardware are also presented, and Issues such as appro: 
matching and compression are discussed. A quali ... 

Fast detection of communication patterns in distributed executions 

Thomas Kunz, Michiel F. H. Seuren 

November 1997 Proceedings of the 1997 conference of the Centre for Advanced Studies on C 

research 

Full text available: '^.pdH-.SI Additional Information: fv!!i.cftation, abstract, references, index.terms 

Understanding distributed applications is a tedious and difficult task. Visualizations based on proo 
are often used to obtain a better understanding of the execution of the application. The visualizat 
Poet, an event tracer developed at the University of Waterloo. However, these diagrams are often 
do not provide the user with the desired overview of the application. In our experience, such tool; 



occurrences of non-trivial commun ... 

Fi|e„seryers„fpine^^^^ 

Liba Svobodova 

December 1984 ACM Computing Surveys (CSUR), Volume 16 Issue 4 

Full text available: ^ pdl^4.23 IVIB) Additional Information: full citation, references, citings, jndex tem^s. rev 



^ Database. partitjonin a cluster of processors 

Domenico Sacca, Gio Wiederhold 

March 1985 ACM Transactions on Database Systems (TODS), volume lo issue i 

Full text available: '^ pdf(2.39 MB) Additional Information: foil citation, abstract, references, citings, index t: 

In a distributed database system the partitioning and allocation of the database over the processc 
network can be a critical aspect of the database design effort. In this paper we develop and evalu 
perform this task in a computationally feasible manner. The network we consider is characterized 
communication bandwidth, considering the processing and input output capacities in its processor 
is typical if the processors are ... 
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Cecil L. Smith 

September 1970 ACM Computing Surveys (CSUR), Volume 2 Issue 3 

Full text available: pdf(2.11 MB) Additional Information: full citation , references, citings, index terms 



Comparison of access methods for time-evolving data 

Betty Salzberg, Vassilis J. Tsotras 

June 1999 ACM Computing Surveys (CSUR), Volume 31 Issue 2 

Full text available: ^.pdf{52Q.53.KB) Additional Information: Mi .citation, abstract, references, citings, jndex t 

This paper compares different indexing techniques proposed for supporting efficient access to tenr 
comparison is based on a collection of important performance criteria, including the space consun 
processing, and query time for representative queries. The comparison is based on worst-case an 
assumptions on data distribution or query frequencies are made. When a number of methods hav 
asymptotic worst-case behavior, features in the methods tha ... 

Keywords: I/O performance, access methods, structures, temporal databases 



Special jMuej„Aj„in.8 

D. Sriram, R. Joobbani 

January 1985 ACM SIGART Bulletin, issue 9i 

Full text available: "^^dftSJiLMBi Additional Information: fuil citation, abstract 

The papers in this special issue were compiled from responses to the announcement in the July 1! 
SIGART newsletter and notices posted over the ARPAnet. The interest being shown in this area is 
sixty papers received from over six countries. About half the papers were received over the comp 

Voice response systems 

D L, Lee, F H. Lochovsky 

December 1983 ACM Computing Surveys (CSUR), Volume is issue 4 

Full text available: ■ffij.d52,22MB). Additional Information: M. citation, Merences^ index Mms. 



.SyMem..architecture^^ 

John W. Gordon 

June 1985 ACM Computing Surveys (CSUR), volume 17 issue 2 



Full text available: pclf(4.61 MB) 



Additional Information; fuii citation, abstract, references, citings, index t 



Computer music is a relatively new field. Wfiile a large proportion of the public is aware of compu 
form or another, there seems to be a need for a better understanding of its capabilities and limita 
synthesis, performance, and recording hardware. This article addresses that need by surveying ar 
architecture of existing computer music systems. System requirements vary according to what th 
used for. Common uses for co ... 

^ 2 Speech .synthesis fo^^^ 

William R. Sanders, Gerard V. Benbassat, Robert L. Smith 

February 1976 Proceedings of the ACM SIGCSE-SIGCUE technical symposium on Computer sc 

education, volume 2 , 8 issue SI , 1 
Full text available: 'g ^pdf(1.03 MB) Additional Information: liiil citation, abstract , references , index terms 

The Institute for Mathematical Studies in the Social Sciences at Stanford (IMSSS) has developed j 
MISS (Microprogrammed Intoned Speech Synthesizer), designed to test the effectiveness of comf 
speech in the context of complex CAI programs. No one method of computer controlled speech pr 
completely satisfactory for all the uses of computer-assisted instruction (CAI). The choice of syntf 
strongly related to the Icinds.of curriculums and in ... 

^ EyMuMon . pl acceM.meth 

F. Rabitti, J. Zizka 

July 1984 Proceedings of the 7th annual international ACM SIGIR conference on Researc 

development in information retrieval 

Full text available: pdf(954.82 KB) Additional Information: fuli citation , abslrad . references 

This paper compares two different approaches for indexing archived text documents. The first apf 
inversion of words in the text, the second on the generation of a signature file representing the te 
system reflecting the word inversion approach is compared against two systems reflecting the sig 
approach and using, alternatively, superimposed coding and the concatenation of word signatures 
estimated using analytical models of these sys ... 

* A . specjfLcMon. of jnVJ A L 

Christopher J. Shaw 

December 1963 Communications of the ACM, volume 6 issue 12 

Full text available: '^ pdf(1.93 MB) Additional Information: fLili citation, references , citinos 




A history of the Promis technoiogy: an effective human interface 

Jan Schultz 

January 1986 Proceedings of the ACM Conference on The history of personal workstations 



Scientific computing systems for individuals were pioneered early at Hewlett-Packard, beginning v 
Desktop Calculator in 1968. Extensions of this first machine were soon seen in Personal Periphera 
Tape Cartridges, and Plotters, and followed by Graphic CRT Displays. By early 1972, the Desktop 
augmented by a very powerful Pocket Calculator, the ground-breaking HP 35A.This paper traces t 
these machines to the present day, ... 

Conference abstracts 

January 1977 Proceedings of the 5th annual ACM computer science conference 

Full text available: ■^.p.df(3,14MB) Additional Information: Ml citation, abstract, indexjerms 

One problem in computer program testing arises when errors are found and corrected after a porl 
have run properly. How can it be shown that a fix to one area of the code does not adversely affe 
another area? What is needed is a quantitative method for assuring that new program modificatio 
new errors into the code. This model considers the retest philosophy that every program instructii 
possibly be reached and tested from the ... 




Full text available: 'MM(2M.MB) 



Additional Information: .fejj.cjtation, abstract,, rexereaces, index.ternis. 



Data base directions: the next steps 

John L. Berg 



November 1976 , Volume 8 , 8 issue 4 , 2 



Full text available: pdff9.95 MS) Additional Infornnation: fuW citation, abstract 

What information about data base technology does a manager need to make prudent decisions at 
technology? To provide this information the National Bureau of Standards and the Association for 
Machinery established a workshop of approximately 80 experts in five major subject areas. The fi' 
were auditing, evolving technology, government regulations, standards, and user experience. Eac 
report contained in these proceedings. The proceedings p ... 

Keywords: DBMS, auditing, cost/benefit analysis, data base, data base management, governmei 
management objectives, privacy, security, standards, technology assessment, user experience 



jnipiementj^ng.rankm^^ 

W. Bruce Croft, Pasquale Savino 

January 1988 ACM Transactions on Information Systems (TOIS), volume 6 issue i 

Full text available: ' gpdtn.59 MB) Additional Information: full citation, abstract, references, citings, index t 

Signature files provide an efficient access method for text in documents, but retrieval is usually lii 
documents that contain a specified Boolean pattern of words. Effective retrieval requires that doci 
meanings be found through a process of plausible inference. The simplest way of implementing ti- 
ls to rank documents in order of their probability of relevance. In this paper techniques are descri 
implementing probabilistic ranking ... 

Declusterlng.pf,keyrb^ 

Paolo Ciaccia, Paolo Tiberio, Pavel Zezula 

September 1996 ACM Transactions on Database Systems (TODS), Volume 21 Issue 3 

Full text available: pdf{2.58 MB) Additional Information: t-jll citation, abstract, references, citings, index [• 

Access methods based on signature files can largely benefit from possibilities offered by parallel e 
this end, an effective declustering strategy that would distribute signatures over a set of parallel i 
has to be combined with a synergic clustering which is employed to avoid searching the whole sig 
executing a query. This article proposes two parallel signature file organizations, Hamming Filter ( 
Keywords: error correcting codes, information retrieval, parallel independent disks, partial matcl 
performance evaluation, superimposed coding 



.CgmpytationaistM 

Paul Suetens, Pascal Fua, Andrew J. Hanson 

March 1992 ACM Computing Surveys (CSUR), Volume 24 Issue 1 

Full text available: 'g!|pdff6.37 MB) Additional Information: KiW ciiation, abstract, references, citings, index t 

This article reviews the available methods for automated identification of objects in digital images 
are classified into groups according to the nature of the computational strategy used. Four classe* 
the simplest strategies, which work on data appropriate for feature vector classification, (2) meth 
models to symbolic data structures for situations involving reliable data and complex models, (3) 
models to the photometry and ... 

Keywords: image understanding, model-based vision, object recognition 
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